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Abstract 

Levels of onnega-6 (n-6) and omega-3 {n-3), long chain polyunsaturated fatty acids (LcPUFAs) such as arachidonic acid (AA; 
20:4, n-6), elcosapentaenoic acid (EPA; 20:5, n-3) and docosahexaenoic acid (DHA; 22:6, n-3) impact a wide range of 
biological activities, including immune signaling, inflammation, and brain development and function. Two desaturase steps 
(A6, encoded by FADS2 and A5, encoded by FADS!) are rate limiting in the conversion of dietary essential 18 carbon PUFAs 
(ISC-PUFAs) such as LA (18:2, n-6) to AA and ot-linolenic acid (ALA, 18:3, n-3) to EPA and DHA. GWAS and candidate gene 
studies have consistently identified genetic variants within FADS! and FADS2 as determinants of desaturase efficiencies and 
levels of LcPUFAs in circulating, cellular and breast milk lipids. Importantly, these same variants are documented 
determinants of important cardiovascular disease risk factors (total, LDL, and HDL cholesterol, triglycerides, CRP and 
proinflammatory eicosanoids). FADSl and FADS2 lie head-to-head (5' to 5') in a cluster configuration on chromosome 11 
(llql2.2). There is considerable linkage disequilibrium (LD) in this region, where multiple SNPs display association with 
LcPUFA levels. For instance, rsl74537, located ~15 kb downstream of FADSl, is associated with both FADSl desaturase 
activity and with circulating AA levels (p-value for AA levels = 5.95x10"''^) in humans. To determine if DNA methylation 
variation impacts FADS activities, we performed genome-wide allele-specific methylation (ASIVl) with rsl 74537 in 144 
human liver samples. This approach identified highly significant ASM with CpG sites between FADSl and FADS2 in a putative 
enhancer signature region, leading to the hypothesis that the phenotypic associations of rsl 74537 are likely due to 
methylation differences. In support of this hypothesis, methylation levels of the most significant probe were strongly 
associated with FADSl and, to a lesser degree, FADS2 activities. 
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Introduction 

Association studies have identified liundreds of relationships 
between single nucleotide polymorphisms (SNPs) and phenotypic 
traits [1]. However, only a few studies have clearly determined the 
causal mechanisms that underlie observed SNP-trait associations. 
Variants may affect gene expression in several ways, including 
altering the regulatory landscape (e.g., promoter or enhancer) of a 
gene, alternative splicing, transcript degradation, and transcription 
of non-coding RNA. However, associations are typically due to a 
genetic variant in linkage disequilibrium (LD) with the associated 
SNP. Alternatively, an epigenetic alteration may be responsible for 
the observed biological variation. Recent genome-scale mapping 
of DNA methylation suggests that methylation impacts gene 



transcription via a variety of mechanisms, depending on its 
location (i.e., transcriptional start sites, gene bodies, regulatory 
elements and repeat sequences) within the transcriptional unit [2] . 

Long chain PUFAs (LcPUFAs) can be synthesized from 
essential, 18 carbon PUFAs (18C-PUFAs) found in the human 
diet, primarily as vegetable oil products (soybean, corn, palm, and 
canola oils as well as margarine and shortenings) (Figure 1). There 
has been a dramatic (3-4 fold) increase in the dietary levels of the 
omega-6 18C-PUFA, Unoleic acid (to 6-8% of daily energy 
consumed) in the past 50 years, as well as an intense debate about 
the health impact of this rise [3] . Much of this discussion centers 
on a difference in opinion with regard to the degree that humans 
have the capacity to convert 18C-PUFAs to LcPUFAs. Once 
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LcPUFAs are formed, they are then incorporated into glycer- 
olipids found in lipoprotein particles in cells and tissues (Figure 1). 
Once in cellular membranes, LcPUFAs can be liberated (typically 
after immunologic activation) as free fatty acids by a family of 
phospholipases (PLAg), diacylglyceride, and monoacylglyceride 
lipases and converted to a large family of eicosanoid products 
(including prostaglandins, thromboxanes, leukotrienes and lipox- 
ins) [4]. In general, these eicosanoids act like local hormones to 
regulate acute and chronic inflammation in numerous human 
diseases. 

First cloned in 1999 [5], the A5 {FADSl) and A6 {FADS2) 
desaturases are encoded by members of a gene family cluster 
{FALSI, FADS2 and FADS3) localized to a 100 kb region on 
chromosome 11 (llql2.2) [5,6]. Speculated to have arisen during 
human evolution by gene duplication, FADS 1-3 have a high 



degree of sequence identity (62-70%), almost identical intron/ 
exon organization [5] and are highly conserved between species. 
Numerous studies have identified associations between genetic 
variants in FADSl and FADS2 and LcPUFA levels, as well as 
precursor-product desaturase activities, with littie evidence for 
genetic loci outside of 1 lql2.2 (reviewed in Lattka et al, 2009 [7]). 
Other studies have found strong associations between variants in 
the FADS cluster with total, LDL, and HDL cholesterol, 
triglycerides, phospholipids, C-reactive protein, proinflammatory 
eicosanoids, and cadiovascular disease endpoints[8-13]. 

While much of the aforementioned research has been carried 
out in populations of European descent, recent evidence docu- 
ments consistent effects in populations of African ancestry [14,15]. 
The extensive LD in European Americans is considerably less in 
African Americans, and rs 174537 is the SNP with the strongest 
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Figure 1. LcPUFA biosyntKiesis from dietary essential PUFAs Pathway. Key enzymes and metabolites are shown, and the roles oi FADSl (red) 

and FADS2 (blue) are indicated. 
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Figure 2. Manhattan plots for association of rs1 74537 with HumanMethylation450 sites. P-values are based on the genetic trend test and 
adjusted for age, chip, and chip position. The blue line indicates the Bonferroni-adjusted level of significance of 1 .03 xlO"' (0.05/485,000). The upper 
panel shows results from the meta-analysis (using European Americans and African-Americans), the middle panel shows European Americans only, 
and the lower panel shows African-Americans only. 
doi:10.1371/journal.pone.0097510.g002 



evidence for association with AA levels [14,16]. rsl74537 is 
located within the 3' end of MIKF (myelin regulatory factor), but 
is also within a haplotype block spanning ~30 kb (based on the 
1000 Genomes Project) that includes FADSl and coincides with 
the peak positive selection signal. This block overlaps completely 
with the peak association signal with LcPUFAs in African 
Americans [14], and includes rsl74537 as well as three known 
eQTL SNPs for FADSl [17] (rsl74547, rsl74548 and rsl74549). 
Importantly, there are dramatic differences in the frequency of the 
allele (G) associated with increased levels of LcPUFAs and FADS 1 



activity in African American populations [14,15] studied to date, 
and most notably, this allele is fixed in African populations [18]. 
There is strong evidence across multiple sources, including the 
1000 Genomes Project and the Human Genome Diversity Panel, 
that this fixation is likely to have occurred as a result of positive 
selection approximately 85,000 years ago [18]. 
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Results and Discussion 

Given recent evidence that variation in DNA methylation can 
be associated with cis genetic variation [19,20], we performed a 
genome-wide, allele-specific methylation (ASM) analysis with 
rsl74537. Our hypothesis was that rsl74537 may serve as a 
genetic proxy for DNA methylation within the FADS cluster. As 
liver is known to be a primary organ involved in LcPUFA 
biosynthesis, DNA was extracted from 144 liver samples obtained 
from the Pathobiological Determinants of Atherosclerosis in Youth 
(PDAY) study [21]. Briefly, PDAY was an autopsy study designed 
to examine the pathogenesis of atherosclerosis in young people. A 
subset of the total population, selected for a separate study to 
examine non-HDL cholesterol, consisted of subjects with the 
lowest 25''' (controls) and highest 10''' (cases) percentile of non- 
HDL cholesterol. Samples were from 72 European American and 
72 African- American males, 15 to 34 years of age, who died of 
violent causes within 72 hours after injury and underwent autopsy 
in one of the cooperating medical examiners' laboratories. 
rsl74537 genotypes were obtained as part of a custom genotyping 
panel of 77 SNPs, utilizing the Sequenom iPlex genotyping system 
(see Methods). To determine if ASM associations existed 
throughout the genome, all probes represented on the lUumina 
HumanMethylation450 BeadChip (485,577 CpG sites) were tested 
for association with rs 174537. Only one region, on chromosome 
llql2.2, showed a marked ASM association with rsl74537 in 
both ethnicities; meta-analysis was performed combining both 
ethnicities, accounting for sample size and direction of effect 
(Figure 2). The most significant association was observed with the 
methylation probe cg27386326 (p = 2.69x10"^^ for the meta- 
analysis; p = 3.66x10 ^"^ in European Americans and 
p= 1.78x10 in African Americans), located approximately 
3.5 kb from the FADSl transcription initiation site. As with 
previous association reports with rs 174537 and LcPUFA levels, the 
effect size of rs 17453 7 on methylation status was similar between 
the two groups (P= — 0.19±0.01 in European Americans and 
P=— 0.26±0.04 in African Americans). Four other sites - 
cgl6213375 (p = 9.76x10"'^), cgl0515671 (p = 6.93x10"'*), 
cg03805684 (p = 6.22x10"'°), and cgl9610905 (p = 3.09x10"") 
— reached a Bonferroni-adjusted level of significance (i.e., 0.05/ 
485,000= 1.03x10"'; blue line in Figure 2). Validation of the 
original finding was performed using pyrosequencing of the 
cg27386326 CpG site, with the probe CG27386326_04 (Qiagen, 
Inc.). DNA methylation of the locus with pyrosequencing was 
strongly correlated (r^ = 0.81) with the BeadChip450 k result. To 
ensure that selection bias was not affecting our results in this 
population, we performed a simple association analysis with 
rs 1745 3 7 and case-control status based on the non-HDL 
phenotype. No evidence of association was observed (p = 0.71, 
additive model, race-adjusted). 

Closer examination showed that these methylation loci are 
located in a likely regulatory region 5 ' of both FADSl and FADS2 
(Figure 3). ENCODE regulation tracks revealed that the highly 
associated methylation region is located in a region that has a 
potential "enhancer signature," namely monomethylation of 
histone H3 lysine4 (H3K4mel) and acetylation of H3K27 
(H3K27Ac; lower zoomed portion of Figure 3). In addition, this 
site is in or near transcription factor binding sites for c-Fos, 
STAT3 and MafK. 

To better determine the putative role of cg27386326 methyl- 
ation and FADS activity, we measured fatty acid levels within the 
liver samples and examined association between methylation levels 
and the key surrogate measures of FADSl and FADS2 activities, 
the ratios of AA to DGLA and GLA to LA levels, respectively. 



There was a strong association between methylation status of 
cg27386326 and AA/DGLA (FADSl activity) in liver tissues 
(ANOVA p = 3.99x10""; Figure 4). GLA/LA (FADS2 activity) 
showed a more modest relationship with methylation status 
(p = 0.0067; Figure 4). It is worth noting that statistical differences 
in associations between FADSl and FADS2 activities may center 
around the difficulty of approximating FADS2 activity by 
measuring fatty acid levels within liver tissue. Here, the newly- 
formed product of the FADS2 reaction, GLA, is rapidly converted 
to DGLA \ia elongation and can no longer be measured, thus 
underestimating its activity. In contrast, the product of the FADSl 
reaction, AA, accumulates in cells and tissues such as the liver. 

Figure 4 also illustrates the distribution of rs 1745 3 7 genotypes 
among the samples, with GG, GT and TT clearly associated with 
diflFerent levels of methylation at cg27386326. This methylation 
proportion differed by 40% between homozygotes at rsl74537 
(44 'M), GG genotype; 84%, TT genotype). To our knowledge, this 
magnitude of allele-specific or allele-associated methylation has 
not been observed outside of the imprinting and X-chromosome 
inactivation context. 

One possible explanation for the association between the 
genetic variant rsl74537 and cg27386326 methylation level is 
that a SNP in LD with rsl74537 exists at the targeted cg27386326 
CpG site. Such a methylation-altering SNP, known as an mSNP, 
would lead to an apparent change in DNA methylation that was 
actually due to alternate genotypes at the CpG site [22]. To rule 
out this possibility, we sequenced the region encompassing the 
cg27386326 probe in the study population and found no SNP 
within the 50bp probe sequence or the targeted CpG site. 

Both genotype and the related methylation were associated with 
FADS activities in the liver, but the effect of genotype nearly goes 
away once methylation is accounted for. Taken together, these 
data suggest that rsl74537, and SNPs in LD with rsl74537, may 
serve as genetic markers for the methylation status of one or more 
critical CpG sites in the FADS gene cluster. This possibility is 
made more interesting by the noteworthy location of these sites, 
specifically in a region between the promoters for FADSl and 
FADS2 and within a putative enhancer histone modification 
signature. These data also raise the key question of whether both 
genes are regulated by a single mcclianism, the methylation of an 
enhancer region between two proximal promoters. 

This study has several strengths and limitations. Given the 
nature of the PDAY study, we have been able to examine 
methylation levels in a highly relevant tissue for PUFA metabo- 
lism, and correlate those levels with a SNP that has been 
associated in population studies. Unfortunately, gene expression 
levels could not be measured, due to mRNA degradation of these 
autopsy samples. However, a recent study analyzing cQTL in fiver 
samples from 427 Caucasian subjects showed that rsl 74548, 
which is in strong LD with rsl 74537 in the CEU population 
(r^ = 0.81 in the 1000 Genomes Project) was highly associated with 
two probes for FADSl expression (p= 1.52x10"* and 1.74x10"^) 
[17]. These data suggest that the impact of rsl74537 on FADSl 
activity is altered via transcription, raising the important question 
of how methylation of this enhancer region impacts expression of 
genes in the FADS cluster. Certainly, recent studies reveal that 
enhancers located at variable distances from promoters can play a 
key role in regulating gene expression [2] . While they are typically 
in CpG-poor regions, whole-methylome analysis has shown them 
to have highly variable levels of methylation [23]. Moreover, 
several recent studies have linked the methylation status of an 
enhancer region and enhancer function, but the interaction 
between methylation and transcription factor binding is still poorly 
understood [24,25]. 



PLOS ONE I www.plosone.org 



4 



May 2014 I Volume 9 | Issue 5 | e97510 



FADS Methylation Is Associated with Activity 



25- 



5- 



rsl74537 



• cg27386326 



MYRF- 
MYRF- 



FEN1 I 



TMEM258 (-ff 



FADS1 



MiRSH 



MIR1908I 
I I II I 



FADS2 <~ 



Scale 
chrll: 
FADS1 - 

cg23992449 
cg271 73322 
cg051 68842 



2kb[- 



j hg19 

I 61.584,500| 61,585,000| 61,585,500| 61.586,000| 61,58S,500| 61,587,000| 61.587,500| 61.588,000| SI, 588,500] 61,589,000| 



48.8 

Layered H3K4Me1 
0.04. 
194.84 
Layered H3K27Ac 
0.04. 
215.12 
Layered H3K4Me3 

0.04 

Txn Factor ChlP|. 



cg020B5160 
cg09462826 

cg25837350 | 

cg06405978 
cg06g85934 
cg13121120 
cgl 621 3375 



cgl 051 5671 I 
cg07709195 



cg27386326 1 





c-Fos 
STAT3 ■ 
MatK_(ab50322) 



GR 



Figure 3. Association of f/lDS cluster CpG sites with rsl 74537. The lower panel shows the location of the CpG sites relative to the ENCODE 
data for H3K4Me1, H3K27Ac, and H3K4iVle3 histone marks, respectively, which indicate promoter and/or regulatory regions in seven cell lines. The 
peak methylation site is shown in red, and the other two sites from the region that met Bonferroni adjustment are shown in blue. 
doi:1 0.1 371 /journal.pone.00975 1 0.g003 



We propose a pathway in which methylation of cg2 7 386326 
(and nearby loci) plays a key causal role in the formation of 
LcPUFA-containing lipids, cholesterol, triglyceride and eicosanoid 
formation, and ultimately clinical endpoints associated with 
human disease (Figure 5). As described above, there has been a 
dramatic increase in human exposure to n-6 18C-PUFA over the 
past 50 years. Additionally, it is clear that certain SNPs within the 
FADS cluster, such as rsl 74537, are strongly associated with the 
degree to which IBC-PUFAs are converted to LcPUFAs and the 
formation of LcPUFA-containing lipids [3,14,15,18], Perhaps it is 
not surprising that many of these same variants are strongly 
associated with biomarkers of human disease, including total 



cholesterol, LDL-cholesterol, HDL-cholesterol, triglycerides and 
eicosanoids, given that LcPUFAs play a key role in the biosynthesis 
of each. This study suggests that rsl 74537 is a strong genetic proxy 
for DNA methylation in a putative enhancer region of the FADS 
cluster, and that the methylation status of these loci may direcdy 
impact FADSl and FADS2 activity. 

Materials and Methods 

Study Samples 

Liver samples were obtained from the Pathobiological Deter- 
minants of Atherosclerosis in Youth (PDAY) study [2 1] , PDAY 
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was an autopsy study designed to examine the pathogenesis of 
atherosclerosis in young people. A subset of the total population, 
selected for a separate study, consisted of subjects with the lowest 
25* (controls) and highest 10''^ (cases) percentile of non-HDL 
cholesterol. Samples were from 72 European American and 72 



African- American males, 15 to 34 years of age, who died of violent 
causes within 72 hours after injury and underwent autopsy in one 
of the cooperating medical examiners' laboratories. DNA was 
isolated from liver samples that had been stored at — 80C. Five 
hundred to 700 mg of thawed liver tissues were homogenized with 
a Dispomix Drive (Medic Tools AG, Switzerland) and genomic 
DNA extracted with a MagneSU Genomic, Large Volume system 
(Promega, USA) process that had been automated on a Freedom 
EVO liquid handler (Tecan, Switzerland). Extracted DNA was 
quantitated with PicoGreen reagent (Molecular Probes, USA) and 
verified as high molecular weight (>50 Kb) by agarose gel 
electrophoresis. This study used DNA obtained from liver samples 
acquired at autopsy. Since all study subjects were deceased at the 
time of study, use of these specimens is not considered Human 
Subjects research. 

SNP Genotyping 

A panel of 77 SNPs for genotyping in six fatty acid candidate 
genes [FADSl, FADS2, EL0VL2, EL0VL5, ACADll and ACOXl) 
were selected by using 1) using an r^ threshold of 0.7 in the 
HapMap CEU population, and 2) additional SNPs to guarantee 
tagging at an r" threshold of 0.7 in the HapMap YRI population, 
using the tagger option in Haploview [26]. rsl 74537 was forced to 
be a tagging SNP, due to its extensive associations in the literature. 
SNP genotyping was performed with the Sequenom iPlex 
genotyping system, and only rsl 74537 was included for analysis 
in this study. 

Sequencing Around the cg27386326 Probe Sequence 

DNA probes on the HumanMethylation450 BeadChip consist 
of 50bp fragments, and the subsequent base is the assayed CpG. 
To exclude the possibility that the CpG site included a SNP, we 
sequenced the 50bp probe and immediately surrounding sequence 
using Sanger sequencing. A 724 bp PGR product was amplified 
using the foUowing primers: 5'-ATGATGTAAGTTTGGCTA- 
CAGAGA-3' and 5'-CAATTCAGCAAATTTATGTGGG-3'. 
PGR cycling conditions were 95°G, 5 min; 30 cycles of 95°G for 
30 sec, 56°G for 30 sec, and 72°C for 1 min; followed by a 5 min 
extension at 72°C. Sequencing reactions were performed using the 
ABI BigDye Terminator VI. 1 chemistry. Sequencing products 
were run on an ABI 3730 xL DNA Analyzer (Applied Bioystems, 
Inc., Foster Gity, GA) and analyzed with Sequencher V4.8 
(GeneGodes Corp, Ann Arbor, MI). 
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DNA Methylation Analysis 

Samples were evaluated using the lUumina HumanMethyla- 
tion450 BeadChip, which assays 485,577 unique CpG sites. The 
average beta (essentially the ratio of the methylated to unmethy- 
lated signal) for each site was used to test for differences by 
genotype. Association analysis was performed in the European 
American and African-American samples separately using a 
generalized linear model (proc glm), as implemented in SAS 
(Gary, NC). Age, chip, and chip position were included in the 
model as covariates. Meta analysis was performed using METAL 
[27], weighting by sample size and accounting for direction of 
effect. 



Measurement of Fatty Acids in Liver Samples 

Fatty acids were measured in homogenized liver samples. Tissue 
was homogenized in ice-cold deionized water at 1 00 mg/ ml. Fatty 
acid methyl esters were prepared in tripli[:ate homogenate samples 
(100 |Xl) following a modification of Metcalfe et al. [28] and 
analyzed by gas chromatography as previously described [29]. 
Fatty acids in samples were identified based on retention times of 
commercially available authentic fatty acid methyl esters. 
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